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ABSTRACT 

Hfq is a posttranscriptional riboregulator and RNA 
chaperone that binds small RNAs and target mRNAs 
to effect their annealing and message-specific regu- 
lation in response to environmental stressors. 
Structures of Hfq-RNA complexes indicate that 
U-rich sequences prefer the proximal face and 
A-rich sequences the distal face; however, the 
Hfq-binding sites of most RNAs are unknown. 
Here, we present an Hfq-RNA mapping approach 
that uses single tryptophan-substituted Hfq 
proteins, all of which retain the wild-type Hfq struc- 
ture, and tryptophan fluorescence quenching (TFQ) 
by proximal RNA binding. TFQ properly identified 
the respective distal and proximal binding of A15 
and Ue RNA to Gram-negative Escherichia coli (Ec) 
Hfq and the distal face binding of (AA)3A, (AU)3A and 
(AC)3A to Gram-positive Staphylococcus aureus (Sa) 
Hfq. The inability of (GU)3G to bind the distal face of 
Sa Hfq reveals the (R-L)r, binding motif is a more 
restrictive (A-L)r, binding motif. Remarkably Hfq 
from Gram-positive Listeria monocytogenes (Lm) 
binds (GU)3G on its proximal face. TFQ experiments 
also revealed the Ec Hfq (A-R-N)n distal face-binding 
motif should be redefined as an (A-A-N)n binding 
motif. TFQ data also demonstrated that the 5'-un- 
translated region of hfq mRNA binds both the prox- 
imal and distal faces of Ec Hfq and the unstructured 
G-terminus. 

INTRODUCTION 

Hfq is a pleiotropic posttranscriptional regulator found in 
many Gram-negative and Gram-positive bacteria (1-5). 
Hfq was first identified as the Escherichia coli (Ec) host 
factor necessary for replication of the RNA bacteriophage 
QP (6,7) but its more significant role in bacterial physi- 
ology has become clear over the past decade (8,9). Hfq 



acts as an RNA chaperone by binding to non-coding 
small RNAs (sRNAs) and their target mRNAs, 
facilitating the annealing of their complementary trans- 
encoded sequences and effecting message-specific regula- 
tion (10-16). Hfq appears to down-regulate most targeted 
messages; however, some mRNAs, such as rpoS, which 
encodes the stationary phase sigma factor a^, require 
Hfq for efficient translation (17). Hfq regulates multiple 
pathways involved in stress responses, membrane integ- 
rity, quorum sensing and virulence (3,18-21). This 
protein is an essential virulence factor in Ec, Salmonella 
typhimurium (St), Pseudomonas aeruginosa, Vibrio 
cholerae and other pathogenic bacteria (20,22-25). 
Recently, Hfq has been found to be involved in the 
multidrug resistance mechanisms of Ec and Salmonella 
enterica serovar typhimurium (26,27). This resistance is 
conferred by Hfq regulation of the expression of the 
AcrAB multidrug efflux pump in Ec (26) and the SmvA 
efflux pump in S. enterica (27). Hfq has also been shown 
to autorepress its expression by binding to at least two 
sequences, site A and site B, in the 5'-untranslated 
region (5'-UTR) of hfq mRNA (28-31). 

Currently two non-exclusive models of Hfq function in 
riboregulation are supported by experimental data (17). 
One emphasizes the chaperone activity of Hfq, which 
actively unfolds the secondary structures of mRNAs or 
sRNAs to facihtate sRNA-mRNA annealing or to alter 
access to the ribosome binding site. The second model 
assumes that Hfq binds to mRNA and sRNA at the 
same time, thereby increasing the local concentration 
and facihtating anneahng. 

The structures of full-length Staphylococcus aureus (Sa) 
Hfq and the conserved core of Ec Hfq (residues 2-69), as 
well as those of the Hfq proteins from other bacteria, have 
revealed a hexameric toroid and an Sm fold (32,33). Each 
subunit consists of a highly bent five-stranded antiparallel 
P sheet whereby p strands 1 through 3 comprise the Sml 
motif and p strands 4 and 5, the Sm2 motif In most 
eubacteria, an N-terminal a-helix sits on one side of 
each subunit. This helix-containing face is designated the 
proximal face; the opposite side is called the distal face. 
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Interestingly, Hfq proteins contain a highly variable 
C-terminus, e.g. Ec Hfq has a 44-residue-long 
C-terminal tail beyond its Sm core, whereas Sa Hfq has 
<10 residues beyond the Sm core (32). To date the 
C-terminus has not been visible in high-resolution 
crystal structures, most likely due to its inherent flexibihty 
(32,34). 

Several structures of Hfq bound to shorter RNA se- 
quences have been determined. The structure of Sa Hfq 
bound to an AU5G RNA showed the RNA oligomer 
bound in a circular manner in the pore on the proximal 
face (35). Other structures have been solved with RNA 
bound to the proximal face, two in which the Ec Hfq is 
bound to AUfiA (36,37) and the St Hfq-Ue complex (38). 
These structures reveal a similar, but non-identical, 
binding mechanism with a single nucleotide bound 
per subunit. The base of the nucleotide stacks between 
adjacent aromatic residues (F42 in Ec Hfq) and 
sequence specificity is conferred mainly by hydrogen 
bonds between the base and either side chain or 
backbone atoms of nearby residues. In the St Hfq-Ug 
structure, the RNA backbone adopts a constrained con- 
formation, which differs from that observed in Sa Hfq- 
AU5G complex (38). The former conformation allows 
specific recognition of the free terminal 3'-hydroxyl 
group, thus explaining the preferential binding of Hfq to 
U-rich 3' ends. The Ec Hfq-AUgA complexes show either 
an adenosine and three uridines or four uridines bound in 
the pore in a manner more similar to AU5G binding to Sa 
Hfq (36). These structures support the idea that U-rich 
RNA sequences, often found in sRNAs, will bind prefer- 
entially to the proximal face. 

Four structures of RNA bound to the distal face have 
also been determined. These are Ec Hfq-Ai5 (39), Bacillus 
subtilis (Bs) Hfq-(AG)3A aptamer (40), Sa Hfq-Ay (41) 
and the Ec Hfq-Ay-AUgA ternary complex (37). The Ec 
Hfq-Ai5 and Ec Hfq-A? complex structures show three 
nucleotides bound per subunit of Hfq and support the 
hypothesis that (A-R-N)n sequences, where A is an 
adenine nucleotide, R is any purine nucleotide and N is 
any nucleotide, bind to the distal face. The (A-R-N)n 
motif has been shown to be critical in the proper sRNA 
regulation of several mRNAs including rpoS, flilA and 
glmS (42-46). The A-site specificity of Ec Hfq is conferred 
by peptide backbone hydrogen bonding (Q33 in Ec Hfq). 
The R-site of Ec Hfq appears to be able to accommodate 
both adenosine and guanosine whereby the purine ring 
sticks into a pocket and packs against a series of 
aromatic and non-polar residues (Y25, L26, 130, L32 in 
Ec Hfq). The adenosine N3, N6, Nl and ribosyl 2'- 
hydroxyl atoms hydrogen bond to N5 of residue N28, 
Os atom of residue Q52, Oy of residue T61 and the 
carbonyl oxygen of residue G29, respectively. The N site 
has no protein-nucleic acid interactions and likely repre- 
sents the entrance or exit point for RNA. By contrast, the 
Bs Hfq-(AG)3A and Sa Hfq-(AA)3A complex structures 
show a significantly different distal-face binding mode, 
designated as the (R-L)n binding motif, where R is a 
purine nucleotide and L is the R site linker and can be 
any nucleotide (41,47). The (R-L)^ binding motif presents 
12 possible binding sites per hexamer versus the possibility 



of 18 binding sites of the (A-R-N)n motif found in the 
Gram-negative Ec Hfq. The presence of an (A-R-N)^ or 
an (R-L)„ binding mode can be attributed to subtle 
sequence differences between Hfq proteins from Gram- 
negative and Gram-positive bacteria (41). Consequently, 
the R sites of the Ec, Bs and Sa Hfqs are equivalent but 
not identical and the mode of protein-nucleobase stacking 
differs so that residues F25 and Q31 are stacking with 
adenines in the Sa Hfq-A4 structure. However, the 
altered sequences also preclude the formation of the 
A-site in the Bs or Sa Hfq proteins (41). 

In total, the published structures of these Hfq-RNA 
complexes establish unequivocally two RNA binding 
faces on Hfq and suggest where and how certain longer 
physiologically relevant RNA sequences may interact with 
Hfq. Yet, they do not provide a complete view of the 
binding modes of larger target mRNAs or sRNAs, 
which display a variety of secondary structures, e.g. 
DsrA, RhyB, MicA, OxyS and Spot42 (48). To help fill 
this gap a series of small angle x-ray scattering studies 
(SAXS) on full-length Hfq bound to intact sRNAs 
(34,49,50) have been carried out and suggested that a 
single RNA sequence will interact with Hfq on only one 
face of the hexamer at a time (49). In these SAXS struc- 
tures, the RNA sequences lie across the face of the 
molecule such that not all subunits contact the RNA; add- 
itionally the RNA does not appear to be bound circularly, 
as observed with the shorter RNA sequences used in 
crystal structures. Further, two SAXS studies suggest 
conflicting roles for the Hfq C-terminus, with one study 
concluding that this region may interact with RNA (50), 
whereas the other study sees no interaction (49). This in- 
dicates that further studies on the role of the C-terminus in 
RNA binding need to be conducted. 

One current impediment to understanding the full 
function of Hfq is the dearth of structural information 
on longer and physiologically relevant RNAs. To begin 
to address this problem we have developed a rapid, struc- 
ture-guided nucleobase-intrinsic tryptophan fluorescence 
quenching (TFQ) approach to map the region or regions 
to which an RNA sequence binds on Hfq and to identify 
hkely interacting amino acid residues. This methodology 
takes advantage of the lack of Trp residues in the Ec, Sa 
and Listeria monocytogenes (Lm) Hfq, our 'test' proteins, 
but clearly is not limited to only Hfq proteins that do not 
contain Trp residues. Thus, we have created multiple Ec, 
Sa and Lm Hfq proteins that contain a single engineered 
Trp residue, which has been placed throughout the entire 
length of the full-length Ec Hfq sequence, at selected 
proximal locations in full-length Sa Hfq and a distal side 
location in Lm Hfq. This technique allows us to identify 
correctly the binding locations of a series of 'control' se- 
quences to Ec, Sa and Lm Hfq, but more important, to 
redefine the Ec Hfq (A-R-N)n as well as the Sa Hfq (R-L)n 
distal-face binding motifs more precisely and to confirm 
that (A-A-N)n and (A-L)^ stretches bind the distal faces of 
Ec and Sa Hfq, respectively. We also explore the inter- 
actions between Ec Hfq with the 5'-UTR of its own 
mRNA to provide a fuller understanding of how the 
two previously identified binding sites (site A and site B) 
contact Hfq to effect its translational autoregulation. 
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Finally, with this set of Trp-substituted Hfq proteins we 
begin to evaluate whether or not the C-terminus of Ec Hfq 
interacts with any RNA. Our results demonstrate that in- 
trinsic TFQ is a valuable and readily utilizable tool for 
elucidating Hfq-RNA interaction in the absence of high- 
resolution structural information and can be combined 
with other lower resolution structural or biochemical 
data to define the Hfq-binding modes of short oligoribo- 
nucleotides and longer physiologically relevant RNA 
sequences. 

MATERIALS AND METHODS 

Protein over expression, purification and site-directed 
mutagenesis 

Ec, Sa and Lm Hfq were overexpressed in Ec strain 
ER2566 Ahfq using the pTYBll vector. The cells were 
grown in Luria Broth containing 50 fig/ml ampicilhn at 
37°C to an ODgoo between 0.4 and 0.6. Expression was 
induced with 0.5 mM isopropyl P-D-l-thiogalacto- 
pyranoside (IPTG) for 20 h at 18°C. Cells were harvested 
at 4°C and stored at — 80°C or lysed immediately using a 
microfluidizer. To remove contaminating RNA and DNA, 
10|.ig/ml DNAse and 10|.ig/ml RNAse were added to 
crude lysate and stirred at 4°C for 2h before clarification 
by centrifugation at 17 500 rpm for 30min at 4°C. All fuU- 
length Hfq proteins and the Ec Hfq truncate, which is 
composed of residues 2-69 (the core), were purified 
using the IMPACT-CN system as described (10). Wild 
type (WT) Ec and Sa Hfq and their tryptophan mutants 
were buffer exchanged into 20mM (4-(2-hydroxyethyl)-l- 
piperazineethanesulfonic acid) (HEPES)-Na, pH 8.0, 
150mM NaCl, 0.5 mM EDTA and concentrated from 
100 to 200 (hexamer). WT Lm Hfq and its Trp 
mutants were concentrated to 100 in 20 mM 
HEPES-Na, pH 8.0, 500 mM NaCl, 0.5 mM EDTA and 
stored at 25°C due to cold denaturation. WT Ec and Sa 
Hfq were stored at 4°C, whereas their mutants were stored 
at 25°C due to cold denaturation. 

Crystallization, data collection, structure determination 
and refinement 

All Hfq Trp mutants used for crystallization were C- 
terminally truncated at residue 69. Hfq Trp mutant 
crystals were obtained by the hanging-drop vapour diffu- 
sion method. Drops contained 1 ^il of protein to 1 |il of 
weU solution with a range of protein concentrations 
(5-9mg/ml). Proteins were crystallized from solutions 
containing 22-28% polyethlyene glycol (PEG) 3350, 26- 
32% isopropanol and 0.1 M Tris base, pH 8.0-9.0. X-ray 
intensity data were collected under cryo-conditions at the 
Advanced Photon Source (Beamline 22-ID or 22-BM) in 
Argonne, IL. Crystals were flash frozen directly from the 
crystallization drops in a nitrogen stream at 100 K. Data 
were processed using HKL-2000 or HKL-3000 (51). 
Structure determination via molecular replacement was 
carried out in Phaser (52) using apo Hfq (PDB ID: 
1HK9) as the search model. Structure building and refine- 
ment were carried out in Coot (53) and Phenix (54), re- 
spectively. Selected data collection, reduction and 



refinement statistics are shown in Supplementary Table 
SI. Root Mean Square Deviation (RMSD) calculations 
were done using secondary structure matching and Ca 
atom alignment via SSM superpose (55). 

Determination of Hfq-RNA binding affinity by 
fluorescence polarization 

The RNA binding affinities of Ec and Sa Hfq and the Trp 
mutants were determined using a fluorescence polariza- 
tion-based binding assay and a Pan Vera Beacon 2000 
instrument (Invitrogen, Madison, Wl, USA). Hfq was 
serially diluted into 100 |il of binding buffer containing 
20 mM HEPES-Na, pH 8.0, 0.5 niM EDTA, 200 mM 
NaCl and 1 nM 5'-fluorescein-labelled RNA. Samples 
were excited at 490 nm and emission was detected at 
530 nm and data were collected at 295 K. Data were 
analysed assuming a 1:1 binding stoichiometry between 
one Hfq hexamer and one molecule of RNA. The data 
were plotted using KaleidoGraph (Synergy Software) 
and the generated curves were fit using non-linear least 
squared analysis, assuming a biniolecular model such 
that the values represent the protein concentration at 
half maximal RNA binding (56). The binding isotherms 
were fit to the equation, P = {(-Pbound--Pfree) [protein]/ 
(A^D+[protein])}-l-_Pfree, where /"bound is the maximum po- 
larization at saturation, P is the polarization at a given 
protein concentration, /"free is the polarization of free 
fluorescein-labelled RNA and is the equihbrium dis- 
sociation constant. At a minimum aU values were inde- 
pendently determined in triplicate. 

Tryptophan fluorescence quenching 

TFQ measurements were performed using an RF-5301PC 
spectrofluorophotometer (Shimadzu, Nakagyo-ku, 
Kyoto, Japan) at 298 K. TFQ was done by exciting the 
single Hfq Trp residue at 298 nm and scanning the 
emission fluorescence spectrum from 320-400 nm. A 1 ml 
of sample containing 1 |iM of each Hfq protein in binding 
buffer (20 mM HEPES-Na, pH 8.0, 200 mM NaCl, 
0.5 mM EDTA) was scanned and followed by addition 
of a specific RNA sequence. Initially two concentration 
ranges were examined, 10 and 40 (iM RNA, to determine 
the overall sensitivity of the system. However, subsequent 
quenching studies were done using 1 and 4|iM RNA to 
increase the physiological relevance of the result and to 
decrease any ambiguity that might result from secondary, 
lower affinity binding. Each titration was done at 
least three times (Standard deviations are hsted in 
Supplementary Table S2). Data were analysed using 
Microsoft Excel. Quenching was determined using the ar- 
bitrary fluorescence maximal height for each Trp mutant. 
Quenching percentage was calculated using the following 
equation: 

(1 -((Fr-Fb)/(Fo-Fb))) X 100, where Fr is the fluor- 
escence value after addition of RNA to the Hfq solution, 
Fq is the initial fluorescence value of the Hfq solution 
without RNA and Fb is the fluorescence of buffer 
without RNA or Hfq. 



RNA samples and generation of the 5'-UTR of hfq mRNA 

Oligoriboiiucleotides were purchased from IDT 
(Coralville, lA) or Oligos Etc (Wilsonville, OR) and 
used without further purification. The 64-nucleotide 
5'-UTR of the hfq mRNA of Ec was synthesized biochem- 
ically. Briefly, the pMCSG7 vector was modified to 
include the DNA sequence 5'-TAA TAG GAC TCA 
CTA TAG GAT TTT TTC AGA ATC GAA AGG 
TTC AAA GTA CAA ATA AGC ATA TAA GGA 
AAA GAG AGA ATG GGA TCC-3' using standard 
ligation-independent cloning techniques. To prepare 
RNA, the modified vector was midiprepped, phe- 
nohchloroform extracted and 8-20 |ig of clean DNA 
vector was digested using BamHl-HF (NEB, Ipswich, 
MA) at 37°C for 16 h. The T7 RNA polymerase 
reaction [100 |il of lOx Buffer (0.5 M Tris base, pH 7.5, 
0.25 M MgCl2, 0.05 M EDTA), 40 ^1 of 50 mM spermi- 
dine, 200 (il of rNTP mix (20mM/rNTP), 30^1 of 100 
units/ml inorganic pyrophosphatase (NEB), 80^1 of 
1 mg/ml T7 RNA polymerase, 60 |il of 8-20 i-ig/ml 
digested DNA vector, 440|.il of dH20) was carried out 
at 37°C for 16h and quenched with 2ml of 0.45 M 
EDTA. The reaction was then purified by acidic phe- 
nohchloroform extraction and ethanol precipitated at 
-80°C for 16 h. The RNA pellet was further purified 
using 70% ethanol and air dried before resuspending in 
100 |il of lOmM sodium cacodylate, pH 6.5. 

RESULTS AND DISCUSSION 

TFQ identifies known binding faces 

Germane crystal structures demonstrate A15 RNA binds 
to the distal face of the Ec Hfq (39) and Ug RNA binds to 
the proximal face of Ec and St Hfq (37,38). These RNA 
sequences were therefore used as controls to test the cap- 
ability of TFQ to discern Ec Hfq distal face binding from 
proximal face binding, thus validating TFQ as an 
Hfq-RNA interaction-mapping technique. Using the Ec 
Hfq-Ai5 crystal structure as a guide, tryptophan muta- 
tions were introduced at positions Y25, K31, which is 
proximal to but does not interact with A15, and Q33, 
which interacts with A15 through its backbone carbonyl 
and amide groups (Figure 1). Similar contacts are seen in 
the Ec Hfq-Ay complex structure (37). These mutants were 
designed to allow us to confirm the general position in 
which Ai5 bound as well as refine the RNA binding site 
on the distal face. Similarly, we chose to make an F42W 
mutation on the opposite Hfq face to assess proximal face 
binding (Figure 1). An F39W mutation was made to in- 
vestigate a proximal face 'edge' binding site based on its 
recently reported importance in binding the sRNAs ArcZ 
and McaS [(57), http://dx.doi.Org/10.1016/j.jmb.2013.01. 
006] and its observed uridine-triphosphate binding (58). 
To explore potential binding to the positively charged 
cleft on the surface of Ec Hfq we chose to make an 
FllW mutation (Figure 1). On the basis of the identifica- 
tion of a lateral rim binding site for RhyB RNA to St Hfq 
(59) and that the lateral rim residues play a role in Hfq- 
RpoS binding (60) we used an R17W mutation, to study 
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Figure 1. Surface representation of the WT Ec Hfq structure and the 
positions of the tryptophan-substituted residues. Highhghted in red are 
residues on the distal face that were mutated to tryptophan (Trp); in 
yellow are the lateral residues that were mutated to Trp, in blue the 
proximal face residues that were mutated to Trp and in purple the 
position to indicate the beginning to the C-terminal 44 residues of Ec 
Hf A cartoon of the underlying hexamer is shown in grey. PDB ac- 
cession number for the WT Ec Hfq structure is 1HK9 (33). 



whether any of our selected RNAs bind to this site on Ec 
Hfq (Figure 1). Finally, as the role of the Hfq C-terminus 
in RNA binding remains unclear (43,61,62) but in specific 
cases necessary for mRNA regulation (43), we created 
three individual Trp mutations, G77W, Y83W and 
Q95W, in this unstructured region (Figure 1). All muta- 
tions were generated in the full-length Ec hfq sequence. 

TFQ experiments were carried out by titrating RNA 
into 1 |.iM protein (hexamer), which is ~ 10-fold below 
the reported cellular concentration of the Hfq hexamer 
(63), to reduce the major effects of non-specific and 
lower affinity binding. The samples were excited at 
298 nm and the emission spectra scanned froin 320- 
400 nm (Figure 2). After each titration was completed, 
the amount of TFQ was quantified. We define a significant 
Hfq-RNA interaction having occurred when the total 
quenching at 4\\M RNA concentration exceeds 10% 
and this quenching is 2-fold greater than the quenching 
at 1 RNA concentration. If quenching of Trp residues 
inserted individually on both faces meets these criteria 
then we would conclude the RNA Ugand is able to bind 
both faces. If one face is quenched at least 1.5-fold greater 
than the other, then that face is defined as being the 
preferred binding face. 

The initial TFQ experiments with A15 RNA showed 
that Ec Hfq mutants Y25W and Q33W were quenched 
significantly at the RNA concentrations examined 
(Figure 3 A, Supplementary Figure SI A). Some quenching 
was also observed with mutant FllW, suggesting a 
Hnkage between these sites, although structural artefacts 
of the FllW change might be responsible for this 
quenching (see below). Intriguingly, residue G77W 
showed small but significant quenching with A15, 
indicating that this C-terminal residue may interact with 



2740 Nucleic Acids Research, 2014, Vol. 42, No. 4 



F42W + U« 



I 
8 
I 

I ISO 

S 
S 

& 









Fq Buffer 

^— F42W FLUeOuM 

Fu — F42WFLU61UM 


— -77- 


\^ F42WFLU6 2uM 

>k F42WFLU6 3UM 

^ ^S. >^ F42WFLU6 4UM 























320 323 326 329 332 335 338 341 344 347 350353 356 359 362 365 368 371 374 377 380383 386 389 392 395 398 

Wavelength (nm) 

% Quenching = (1 - ((Fr-Fb) (Fq-Fb))) x 100 

Figure 2. Representative intrinsic TFQ titration experiment. To provide the initial value of tryptophan fluorescence, 1 |.iM Hfq mutant F42W in the 
absence of RNA was excited at 298 nm and the emission scanned from 320-400 nm. The maximum fluorescence intensity is found at 343 nm (denoted 
by a solid vertical red line). RNA quenching of the tryptophan fluorescence is calculated by measuring the intensity differences at wavelength 343 nm 
after addition of an RNA aliquot, employing the equation Quenching (%) = (1 — ((Fr— Fb)/(Fo— Fb))) x 100, and carrying out the appropriate 
corrections as described in the 'Materials and Methods' section. 



longer A-tracts in solution, possibly with a flipped out N- 
site nucleotide. As expected, neither proximal-face mutant 
F39W nor F42W is quenched by A15 in the 1^ |.iM range; 
however, they begin to quench in the less physiologically 
relevant 10-40 |iM range (Supplementary Figure SI A), 
indicating low affinity or non-specific binding at these 
high RNA concentrations. Ec Hfq mutants R17W, 
K31W, Y83W and Q95W are not quenched in the lower 
micromolar ranges, indicating no specific interaction 
occurs between these residues and A15. The finding that 
the Q33W but not the K31W residue is quenched by A15 is 
particularly important as those data indicate that RNA 
must be near the residue under study: residue K31W can 
be modelled to be as close as 6 A to the RNA but is not 
quenched, whereas residue Q33W can be modelled to 
stack directly against the adenine ring and is quenched 
(Supplementary Figure S2). 

Analogous TFQ experiments with Ue RNA show that 
Ec Hfq mutant F42W, located on the proximal face, is 
quenched significantly at aU RNA concentrations used, 
whereas mutant FllW has a fluorescence signal that 
is enhanced equally at aU concentrations (Figure 3A; 
Supplementary Figure SIB). No other mutant is 
quenched at the lower concentration ranges (l-4|.iM). In 
the 10^0 i-iM RNA range, the F39W mutant shows some 
quenching, although this is not as significant as that 



observed for Ec mutant F42W, and suggests a second, 
weaker affinity binding site, which is consistent with the 
quenching of the nearby Hfq 'rim' mutant S38W by DsrA 
and other RNA molecules (60), and that this residue binds 
RNA when a U-rich sequence is present at a high local 
concentration such as might occur in SgrS (64) 
(Supplementary Figure SIB). 

In addition to mapping the RNA binding sites on a 
Gram-negative Hfq homologue and to show the general 
utihty of this approach, we extended our studies to Hfq 
homologues from the Gram-positive bacteria Sa and Lm. 
Using the structure of the Sa Hfq-A4 complex (41) we 
created mutants F25W and Q31W. Due to the inability 
to produce Sa Hfq Y42W, a proximal face mutant, we 
chose to study the Lm Hfq after a sequence alignment 
of Sa and Lm Hfq revealed 40% sequence identity and 
81% sequence homology between the two proteins and 
that Lm residue F43 resides in the same location as Y42 
from Sa (Figure 3C). Additionally, Hfq has been shown to 
be functionally important in sRNA regulation in this 
pathogenic bacterium (65). Therefore mutation F43W 
was made in the 77 residue fuU-length Lm Hfq. As 
anticipated, Sa distal face mutants F25W and Q31W are 
quenched in the presence of A15 but are not quenched by 
Ue (Figure 3B). By contrast, proximal face mutant F43W 
is quenched by Ug but not quenched by A15 (Figure 3B). 
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Figure 3. Control TFQ experiments for Ec, Sa and Lm Hfq Trp mutants using A]5, a distal face-binding RNA, or U(,, a proximal face-binding 
RNA. Panels (A) (left) and (B) (left) show A]5 quenching of Ec and Sa/Lm Hfq Trp mutants, respectively. Panels (A) (right) and (B) (right) show 
quenching of Ec and Sa/Lm Hfq Trp mutants, respectively. The x-axis labels under each bar graph refer to the tryptophan-substituted residue within 
that Hfq protein. The percent quenching is shown on the y-axis. The bar graphs are coloured by location on Hfq as shown in Figure 1. The solid bar 
represents the percent quenching by 1 |.iM RNA while the diagonal striped bar above the solid bar represents the percent quenching by 4 |iM RNA. 
(C) A sequence alignment of the Sa and Lm Hfq proteins. The key proximal-face U6 binding residues Sa Y42 and Lm F43 are boxed. Identical 
residues between the two proteins are shown below the ahgnment. 



These results confirm that Trp residues show robust 
intrinsic fluorescence quenching when the appropriate 
RNA sequences are bound nearby at a distance <6A. 
To ensure further that the observed quenching is not 
simply an effect of non-specific quenching of Trp by 
RNA nucleotides, we titrated A] 5 or Ug into 6nM 
N-acetyl-L-tryptophanamide, a Trp residue mimic. 
Neither A15 nor Uf, induced quenching (Supplementary 
Figure S3), indicating that a specific interaction must 
occur to result in TFQ. Since significant quenching is 
observed only for residues that are known from crystal 
structures to interact directly with a particular RNA 
sequence, we conclude that TFQ can be used to map 
accurately RNA binding sites on Hfq. 

Tryptophan mutants do not create false binding sites 

To ensure that the designed mutations did not signifi- 
cantly impact the biochemical function and biologically 
relevant RNA binding sites of Hfq, fluorescence polariza- 
tion-based RNA binding assays were used to determine 
the equilibrium dissociation constants (Kj) of each 



mutant for the control RNA sequences (Supplementary 
Figure S4). It is important to emphasize here that due to 
the high protein and RNA concentrations necessary for 
each TFQ experiment, TFQ could not be used to deter- 
mine the equilibrium dissociation constants of a particular 
mutant for a particular RNA sequence. Calculated 
binding affinities are shown in Table 1. 

As anticipated, the bulkier Trp mutants can lower the 
binding affinity in some cases. Specifically, Ec Hfq mutant 
Y25W reduces the binding affinity of A15 by 184-fold 
(from 1.4 to 258 nM) and Q33W reduces the binding 
affinity of A15 by 12-fold (from 1.4 to 15.9nM). Sa Hfq 
mutant F25W reduces A15 binding by 2700-fold (from 
4.2 nM to 11.3^M) and Sa Q31W reduces A15 binding 
by 387-fold (from 4.2nM to 1.6 [iM). However, all other 
mutations do not affect A15 binding adversely (Table 1). 
We also observed that Ec Hfq mutant F42W reduces Ug 
binding affinity, but by only ~6-fold (from 766 nM to 
4.8 nM), whereas the other Trp-substituted residues 
show no significant effects on binding this ribooligo- 
nucleotide (Table 1). Interestingly, the affinity of Ug for 
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Table 1. Equilibrium dissociation constants (Kj) for Trp mutants to A15 and U(, 

A, 5 U, 



E. coli Hfq 



WT Hfq S 


1.4nM ± 0.9"-'' 


Fold change" 


766.0 nM ± 73.7 


Fold change 


FllW 


7.1 nM ± 0.6 


5.0 


502.7nM ± 55.5 


0.7 


R17W 


3.7nM ± 1.5 


2.6 


1.6^M ± 0.08 


2.1 


Y25W 


258.2 nM ± 32.7 


184.4 


836.8 nM ± 62.5 


1.1 


K31W 


1.4nM ± 0.5 


1.0 


1123.3 ± 73.0 


1.5 


Q33W 


15.9 nM ± 3.6 


11.4 


404.7 nM ± 182.4 


0.5 


F39W 


0.59 nM ± 0.35 


0.4 


2.0 |iM ± 0.4 


2.6 


F42W 


0.28 nM ± 0.09 


0.2 


4.8|.iM ± 782.5 


6.2 


G77W 


2.9nM ± 1.7 


2.1 


379.9 nM ± 93.2 


0.5 


Y83W 


7.4nM ± 2.8 


5.3 


777.9 nM ± 254.1 


1.0 


Q95W 


2.6nM ± 1.1 


1.9 

S. aureus Hfq 


848.3 nM ± 289.4 


1.1 


WT Hfq 


4.2nM ± 0.5'' 




69.8nM ± 7.0*^ 




F25W 


11.3 |xM ± 5.0 


2700 


41.9 nM ± 25.1 


0.6 


Q31W 


1.6|.iM ± 0.08 


386.7 


114nM ± 23.2 


1.6 



"Each value is the average of three individual experiments and the standard deviations. 
''Taken from (39). 

"Fold change is the ratio of the of the mutant Hfq divided by the of the WT Hfq. 
''Taken from (41). Values hsted in bold are all fold increases above 10.0. 



WT Ec Hfq, which to the best of our knowledge has not 
been reported previously, is ~30-fold lower than that 
observed for Ue binding to St Hfq (38). By contrast, the 
binding affinity of Ec Hfq for longer U-tracts is much 
stronger with a = 6.6 nM for Ec Hfq binding to Uie 
(Supplementary Figure S4C), which is similar to the pre- 
viously calculated Ec Hfq binding affinity for U18 (48). 
This indicates that the binding affinity for poly-U se- 
quences by Ec Hfq is strengthened significantly as the 
sequence is lengthened and suggests the recruitment of 
other residues such as F39 in binding. Our experimentally 
obtained value for the Ec Hfq-Ug binding is further sup- 
ported by the finding that when the same stock of 
fluoresceinated Ug is used, we obtain the same nanomolar 
binding affinity for Sa Hfq-Ug as has been reported pre- 
viously (35) and thus conclude that our experimentally 
determined for Ec Hfq-Ug is correct. 

Hence, we are able to conclude that mutations involved 
in RNA binding show a decreased binding affinity for 
those RNA sequences they bind; however, because the 
protein-RNA interaction is still observed via fluorescence 
polarization, this loss of affinity is not detrimental to our 
interpretation of the preferred location of RNA binding as 
ascertained by our TFQ experiments. We are cognizant 
that false negatives may occur, particularly when 
binding to WT Hfq is already weak and a particular tryp- 
tophan mutant lowers the affinity beyond the experimen- 
tal detection limits of TFQ. Hence the need to probe a 
potential RNA binding site by the creation and TFQ 
measurements of multiple tryptophan mutants in the 
area of interest, e.g. the Y25W, K31W and Q33W substi- 
tutions to measure distal face binding. Additionally, we 
find that Trp mutations proximal to a particular binding 
site do not affect the affinity of distant sites for their 
preferred RNA sequence. For example, the Ec Hfq 
F39W mutant binds A15 with a Kj = 0.6 nM, and the 



Ec Hfq Y25W mutant binds Ug with a = 836 nM: 
these are the same values observed for WT Hfq binding 
to these oligoribonucleotides (see Table 1). Thus, at worst, 
the tryptophan mutations lower the affinity of a few 
RNAs for their native binding sites, potentially resulting 
in a false negative binding result for a given single substi- 
tution; however, in no case do the Trp mutations create 
new RNA binding sites (false positives) for the well- 
studied 'control' sequences on either the Ec or Sa Hfq. 

Crystal structures of selected Trp mutants show minor 
structural perturbation 

To verify that the Trp mutations do not have a significant 
impact on the global structure of the resulting Ec Hfq 
protein, we determined the crystal structures of the 
FllW, Y25W, F39W and F42W substituted proteins in 
the context of the C-terminally truncated construct 
(residues 2-69) and compared their structures to the WT 
Ec Hfq structure determined by Sauter et al. (33) (PDB 
ID: 1HK9). Selected crystallographic data are provided in 
Supplementary Table SI. 

Comparison of mutants Y25W, F39W and F42W to 
WT Ec Hfq reveals Httle difference in the overall structure 
(Figure 4A), with the superposition of a single protomer 
yielding an RMSD< 1.05 A in all cases and a hexamer to 
hexamer RMSD< 1.20 A (Table 2). Superposition of the 
FllW mutant hexamer and the hexamer of the WT Hfq 
results in an RMSD of 2.48 A, indicating a change in the 
quaternary structure (Figure 4 A and C). This difference 
results from the significant shortening of two p strands 
due to the adjustment of the backbones of residues Q35 
to F39 on P2 and residues 159 to V62 on P5 that is caused 
by the presence of the bulkier indole side chain. These 
strands form part of the distal-side intersubunit interface, 
which is now perturbed in this tryptophan-substituted 
protein. The remodelling of the P2 and p5 strands and 
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Figure 4. (A) Overlays of the structures of WT Ec Hfq and distal (Y25W), lateral (FllW), proxiinal edge (F39W) and proximal pore (F42W) Hfq 
tryptophan mutants. All proteins are shown as cartoons with WT Hfq coloured cyan, Y25W coloured green, FllW coloured red, F39W coloured 
yellow and F42W coloured blue. The calculated RMSD (listed within the grey box) reveals that each mutation does not affect the protomeric 
structures significantly. (B) Two close up views of the overlay of the structures of the WT Ec Hfq bound to A15 (blue carbon sticks) and the (Y25W) 
distal face mutation (green carbon sticks) near the position of the substitution and R-site. Y25W takes two conformations, one of which occurs in 2 
out of 6 protomers (left). This conformation would block adenine insertion into the R-site (red box). The second conformation (right) occurs in 5 out 
of 6 protomers and allows base stacking with adenine. However the 2' oxygen of the ribose clashes with the indole ring (red arc), requiring 
adjustment of either the phosphodiester or polypeptide backbone or both to relieve the clash. These two structural problems are likely the cause 
for the significant reduction in A15 binding to this distal face mutant. (C) Close up of the area about the (32 and [55 strands after overlaying WT Ec 
Hfq and the FllW liexamers. The two major conforinational differences between the FllW protein and WT Hfq are enclosed within the red boxes 
with position 11 shown and numbered in the rightmost figure. 



Table 2. RMSD of superpositions of WT and mutant Ec Hfq 
protomers or hexamers 



WT:Mutant RMSD 




Monomer:Monomer 


Hexamer:Hexamer 


FllW 


0.30-1.05A 


2.48 A 


Y25W 


0.27-0.64 A 


0.72A 


F39W 


0.37-0.54 A 


1.00 A 


F42W 


0.31-0.52A 


1.20 A 



their altered interactions with the adjacent subunit is the 
likely origin of our FllW fluorescence quenching data. 

The only other structural difference of significance was 
found by the alignment of the Y25W mutant with WT Ec 
Hfq bound to A15. The observed difference is confined to 
the site of the mutation but provides a structural rationale 
for the significant change we observe in the of the 
Y25W protein for A15 (Figure 4B, Table 1). In the 
Y25W protein, W25 takes two conformations, one of 
which would block adenine insertion into the R-site and 
would require the indole ring to flip before adenosine 
could bind and a second conformation, which although 
would allow adenine-indole ring stacking, results in steric 
clash between the 2' ribose oxygen and the indole ring 
(Figure 4B). This would require the adjustment of either 
the RNA phosphodiester or protein polypeptide backbone 
to relieve the clash. Either alone or combined, these 



properties of residue W25 are the likely causes for the 
significant reduction in A15 binding to the distal face 
mutant but do not obliterate binding (Table 1). The struc- 
ture of the Ec Hfq (Y25W) protein bound to A15 will be 
necessary to obtain a complete molecular understanding 
of the diminished affinity of this protein. The structural 
changes observed in Y25W, and indeed FllW, would not 
have been anticipated from modelling studies and under- 
score the importance of having the high-resolution struc- 
tures of those mutants that have unusual TFQ properties 
and significantly lowered RNA binding affinities. 

Ec Hfq distal face binding is restricted to (A-A-N)n motifs 

The single-stranded A-tract RNA binding mode observed 
in the Ec Hfq-Ai5 complex structure suggested that the 
distal face could bind (A-R-N)n tracts, where A = an 
adenine nucleotide, R = any purine nucleotide and 
N = any nucleotide (39). This motif is not found in Hfq 
proteins from Gram-positive bacteria in that the A site is 
not present and the observed distal-face RNA binding 
motif was consistent with an (R-L)n motif, where 
R = any purine nucleotide and L = a linker nucleotide 
(41). Although the Ec Hfq R-site was posited to be able 
to bind guanine nucleotides, no experimental data con- 
firming such binding currently exists. Using our TFQ 
approach, we tested the hypothesis that both (A-R-N)n 
sequences, AAGAAGAAGAAGAAG, (A-A-G)5, and G 
GAGGAGGAGGAGGA, (G-G-A)5, would bind to the 
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distal face with guanine occupying the R-site by necessity 
in the latter sequence. Our results reveal that rather than 
binding to the same face, these sequences bind to different 
faces of Ec Hfq (Figure 5). (G-G-A)5 clearly interacts with 
residue Ec F42W on the proximal face and quenches none 
of the distal face tryptophan residues, supporting 
proximal face binding only (Figure 5A). By contrast, 
(A-A-Os quenches mutants FllW and Q33W but not 
mutants F39W or F42W, indicating distal face binding 
only (Figure 5B). Somewhat surprising at first was the 
weak quenching of R-site mutant Y25W by (A-A-G)5. 
However, given that this mutation can occlude the R- 
site resulting in a 180-fold reduction in binding affinity 
of Ai5, RNA sequences with intrinsically weaker binding 
affinity, such as (A-A-G)5, are likely to be unable to bind 
well even at 4 |iM and hence quench weakly this particular 
R-site substitution. Regardless, distal face binding by 
(A-A-G)5 is demonstrated by strong Q33W quenching 
and the ability of this substituted protein to bind an 
(A-A-N)4 tract with near WT Hfq affinity (39,44). 
Interestingly, C-terminal mutants, G77W, Y83W and 
Q95W are not quenched by (A-A-G)5, which differs 
from G77W quenching by (A-A-A)5 and suggests an al- 
ternative entrance/exit pathway for this A-rich polypurine. 
Further, in the presence of (G-G-A)5, residue Y83W 
quenches, albeit weakly, whereas the other C-terminal 
region residues do not, indicating a role for the 
C-terminal tail in binding to specific RNA sequences. 

These data reveal that the R-site is actually a physiolo- 
gically relevant adenine nucleotide-only binding site, and 
thus the Ec Hfq distal face binding motif is the more re- 
strictive (A-A-N)n motif rather than an (A-R-N)n motif. 
The (A-R-N)n distal face binding motif has been studied 
extensively, but until now it has not been possible to 
confirm that these sequences are binding to the expected 
face. By using TFQ we have been able to demonstrate that 
while sequences, which have an (A-R-N),, motif, do indeed 
bind to Hfq, they do not necessarily bind where expected. 
This conclusion is supported by the observation that the 
ARN-2 binding site of the glmS mRNA, which fulfills an 
(A-A-N)n motif, is the more functionally important distal- 
face binding site as compared with the ARN-1 site, which 
only fulfills an (A-R-N),, motif, binding site (43). These 
results also demonstrate that overall, C-terminal residues 
do not make strong interactions with (A-A-N)^ tracts but 
appear to interact better with longer (G-G-A)^ tracts, 
implying the C-terminal region of Hfq can interact with 
longer proximal face-binding RNA. 

The Sa Hfq distal face is restricted to (A-L)n motifs 

Using our TFQ approach we also tested the hypothesis that 
the Sa Hfq distal face binding motif is not an (R-L)n motif 
but a more restrictive (A-L)„ binding motif as suggested by 
our analyses of the Sa Hfq-A4 (41) and Bs Hfq-(AG)3A 
(47) complex structures. In these structures each has one 
adenine ring inserted into the R-site stacked against residue 
25, whereas the following 3'-nucleotide, an A or G, is 
stacked directly over the L-site forming residue Q31. If 
the R-site is only an adenosine-binding site then the se- 
quences 5'-AUAUAUA-3', (A-U)3A, and 5'-ACACACA 
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Figure 5. TFQ of Ec Hfq by two (A-R-N)n motif RNAs. (A) TFQ by 
(GGA)5. (B) TFQ by (AAG)5. The x-axis labels under each bar graph 
refer to the tryptophan-substituted residue within that Hfq protein. The 
percent quenching is shown on the y-axis. The bar graphs are coloured 
by location on Hfq and are defined at the bottom of the figure. The 
solid bar represents the percent quenching by 1 |.iM RNA, while the 
diagonal bar above the solid bar represents the observed quenching by 
4nM RNA. 



-3', (A-C)3 A, but not 5'-GGGGGGG-3' (G7), and 5'-GUG 
UGUG-3', (G-U)3G, should quench the distal side Trp 
replacements at positions 25 and 31. Both the (A-U)3A 
and (A-C)3A sequences quenched Sa mutants F25W and 
Q31W (Figure 6A and B). Further, the (A-C)3A sequence 
did not quench Lm F43 W, whereas the (A-U)3 A sequence 
quenches this mutant to a level equal to that seen for Sa 
F25W and Q31W. The abihty of (A-U)3A to bind to both 
faces is expected, as adenosine has been observed in crystal 
structures to bind on both faces of Sa Hfq and the (R-L)n 
distal face binding motif will accommodate uracil at the L- 
site. Also, based on previously reported binding data, it is 
known that Sa K33A significantly weakens but does not 
abohsh (A-U)3A binding, whereas it completely abohshes 
(A-C)3A binding (41). This suggests that the previously 
reported binding affinity of (A-U)3A for Sa Q33A reflects 
the binding affinity for the proximal rather than the distal 
face (41). By contrast, neither the Gyuor (G-U)3G sequence 
quenches the fluorescence of the F25W and Q31W 
proteins, but surprisingly the (G-U)3G sequence quenches 
Lm F43W, indicating that this sequence can bind to the 
proximal face (Figure 6C and D). G7 does not quench 
any residues and therefore does not interact with Hfq at 
even high concentration, a finding consistent with our 
earher fluorescence polarization-based binding studies 
(28). Thus, we conclude that the distal face (R-L)^ motif 
of Sa Hfq, Bs Hfq and most hkely Hfq proteins from most 
if not aU Gram-positive bacteria is a more restrictive (A-L)n 
motif. 

These quenching data also indicate that Lm Hfq and 
hkely Hfq proteins from other Gram-positive bacteria 
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quenching by 1 |iM RNA, while the diagonal bar above the sohd bar represents the observed quenching by 4 |iM RNA. 



can bind specific (R-L)n motifs to their proximal faces. 
Tlie observation that only (A-L),, sequences bind to the 
distal face demonstrates the impact that TFQ has on 
understanding how a particular RNA sequence binds 
Hfq. Until now it had been expected that sequences that 
fulfilled an (R-L)n motif would bind the distal face based 
on crystal structures and binding studies. These crystal 
structures all have adenosine bound in the R-site 
binding pocket and as such have led to the hypothesis 
that the R-site is really an adenine nucleotide selective 
site; however, since it is possible to model a guanine 
base in the R-site with some rearrangement of the 
pocket residues, the site could not be labelled unambigu- 
ously as adenine specific. Also, while the binding studies 
indicated that (R-L),, motifs bind Hfq, they provided 
insight into where the interaction occurs. TFQ thus 
allows us to redefine experimentally the proposed (R-L)^ 
distal face binding motif to a more restrictive (A-L)^ 
motif, which has been previously hypothesized but never 
tested (41). Finally, the abihty to detect RNA interacting 
with residue Q31W in Sa Hfq but not with the correspond- 
ing Ec Hfq residue illustrates the power of the TFQ 
approach, as modelling shows that residue Ec Hfq 
K31W would be close to the RNA but unable to 
interact with the base, whereas Sa Hfq residue Q31W 
stacks with the Linker site nucleotide and hence is 
quenched (Supplementary Figure S2B and D). 

Ec Hfq binds the 5'-UTR of Ec hfq mRNA on the 
proximal and distal sides 

Ec Hfq has been shown previously to regulate the amount 
of Ec hfq mRNA that is present in the cell (28) and to 
prevent translation initiation of the hfq mRNA, suggesting 



that the Hfq interaction with hfq mRNA blocks its 
ribosome binding site (29). Furthermore, previous 
studies on the Hfq:/?/^ interaction identified two binding 
sites for Hfq in the 5'-UTR of the hfq mRNA, which were 
termed 'site A' and 'site B' (29) (Figure 7A). To test the 
ability of our TFQ assay to determine the binding modes 
of longer pieces of RNA of physiological relevance, we 
assayed two 24-nucleotide fragments and one 64-nucleo- 
tide fragment of the 5'-UTR of Ec hfq mRNA. On the 
basis of the previous findings we tested the sequences 5'- 
AU U U U U UC AG A A UCG A A AGG U U C A- 3' , which 
contains site A and an adjacent stem loop; 5'-GCAUAU 
AAGGAAAAGAGAGAAUGG-3', which contains site 
B; and an RNA fragment that contains both sites A and 
B (Figure 7A). The site A containing ohgoribonucleotide 
significantly quenches the Q33W, F39W, F42W, G77W 
and Y83W proteins, indicating that site A binds both 
faces. However, because F42W has 2-fold greater TFQ 
and is 33% quenched, a preference for the proximal face 
is strongly suggested and the likely consequence of the 
AUgA stretch at its 5' end. The weaker distal face 
quenching is the hkely result of the presence of two 
AAN triplets within site A. The site A sequence also inter- 
acts with two residues of the C-terminal tail (Figure 7B), 
one of which, Y83W, was also seen to be quenched by 
(G-G-A)5. 

The site B containing ohgoribonucleotide quenches the 
FllW, Y25W, Q33W, F39W, F42W and Y83W proteins 
but prefers the distal face, as strong TFQ is observed using 
1 nM site B, whereas proximal face quenching is only seen 
to occur at four times this concentration (Figure 7C). 
Further, distal face quenching is 1.7-fold greater than 
proximal face quenching when 4|iM is used. Regardless, 
at higher concentrations, site B appears to bind to both 
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each bar graph refer to the tryptophan-substituted residue within that Hfq protein The percent quenching is shown on the y-axis. The bar graphs are 
coloured by location on Hfq and are defined at the bottom of the figure. The solid bar represents the percent quenching by 1 |j.M RNA, while the 
diagonal bar above the solid bar represents the percent quenching by 4|.iM RNA. 



faces and to interact with at least one residue of the 
C-terminal tail (Figure 7C). Using the 64-nucleotide 5'- 
UTR of the hfq mRNA, we observed that when Site A 
and Site B are connected physically, there is binding to 
both faces as residues Q33W and F42W are both 
quenched (Figure 7D). We also see increased quenching 
of the lateral and charged cleft residues, which are located 
in the N-terminal helix, and the C-terminal residues. The 
charged cleft residue, R17W, has not been seen to be 
quenched in any of our previous experiments, indicating 
that longer and more structured RNA sequences may be 
required for binding to occur at this proposed lateral/rim 
binding site (59). These data illustrate that Hfq is capable 
of binding a longer RNA to both sites at the same time, 
supporting current models that hypothesize Hfq can bind 
to multiple, different RNA sequences simultaneously with 
U-rich sRNAs preferring the proximal face and (A-A-N)n 
containing mRNAs binding to the distal face (Figure 8A). 
However, we currently cannot rule out the possibihty that 
there are two Hfq populations, one with Site A bound to 
the proximal face only and the other with Site B bound to 
the distal face only (Figure 8B). Structural studies are 
underway to discern the binding mode of the 5'-UTR of 
hfq mRNA. 



CONCLUSION AND PERSPECTIVE 

We have presented data using a straightforward method 
that can be used to assess the preferred binding site(s) of 
particular RNA sequences to Ec, Sa and Lm Hfq and can 
be readily adapted to understand the RNA binding modes 
of Hfq from other Gram-negative and Gram-positive 
bacteria. By placing a reasonable number of structure- 
guided tryptophan residues in a series of specific locations 
we have shown that TFQ properly identifies where 'con- 
trol' RNA sequences bind (Figure 3). Using TFQ we have 
demonstrated that the (A-R-N)n distal face-binding motif 
of Ec Hfq and likely Hfq proteins from all Gram-negative 
bacteria is a more restrictive (A-A-N)^ binding motif, and 
the (R-L)„ distal face-binding motif of Sa Hfq and pre- 
sumably all Gram-positive Hfq homologues is a more re- 
strictive (A-L)„ binding motif. Surprisingly, the (A-R-N)n 
motif sequence (G-G-A)5 and the (R-L)^ motif sequence 
(G-U)3 G still bind but to the proximal faces of the Ec Hfq 
and Sa/Lm Hfq proteins, respectively. Their detailed 
binding mechanisms will require high-resolution structural 
studies that are underway. Regardless, our TFQ findings 
can be used to hone computational searches for potential 
Hfq-binding RNA sequences and to assign their likely 
binding sites. We have illustrated also that our TFQ 
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Figure 8. Potential models for Ec Hfq binding to the 5'-UTR of hfq 
mRNA. (A) hfq mRNA binding such that a single niRNA uses all sets 
of identified binding sites to interact with both faces of Ec Hfq with the 
linker between the two binding sites wrapping around the outside of the 
Hfq hexamer. (B) Two hfq mRNA binding to the proximal and distal 
faces of a single Hfq hexamer. The 5'-UTR of the hfq mRNA is shown 
as a ribbon with its 5' and 3' ends labelled. The 5' end of the mRNA 
contains Site A, which should wrap about the proximal pore and lead 
to the rim/lateral surface. The 3' end of the mRNA contains Site B, 
which has a preference for the distal face and C-terminal region of Hfq. 
The 5'-UTR of the hfq mRNA is modelled onto a surface rendering of 
Hfq generated in Pymol using PDB ID: 1HK9 (33). 



approach can be used to study interactions between Hfq 
and longer, physiologically relevant RNA sequences. 
Specifically, Ec Hfq is capable of binding to hfq mRNA 
site A and site B and that Hfq uses both faces and its 
C-terminus to bind the 64-nucleotide 5'-UTR of this 
niRNA simultaneously. One potential limitation of this 
approach is the inability to synthesize either biochemically 
or chemically longer RNA sequences in the quantities that 
are needed for each quenching study. However, the upper 
liiTtit of such biosynthesis is well beyond the 64 nucleotides 
of the 5'-UTR of hfcj, allowing the evaluation of the 
binding of full-length sRNAs and the longer, relevant 
parts of their mRNA targets to Hfq both singly and in 
combination. The creation of more tryptophan- 
substituted Hfq proteins should refine further the ability 
of TFQ to reveal primary and any secondary RNA 
binding sites. TFQ has also revealed that several 
residues of the C-terminal region indeed interact with 
certain RNAs and suggests a functional importance of 
this region in riboregulation for at least some RNA 
species. Finally, this method is not limited to the identifi- 
cation of RNA binding loci and should be useful in the 
analysis of Hfq-protein binding. 
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